Four-Valued Knowledge Augmentation for Representing Structured Documents

نویسندگان

  • Mounia Lalmas
  • Thomas Roelleke
چکیده

Structured documents are composed of objects with a content and a logical structure. The e ective retrieval of structured documents requires models that provide for a content-based retrieval of objects that takes into account their logical structure, so that the relevance of an object is not solely based on its content, but also on the logical structure among objects. This paper proposes a formal model for representing structured documents where the content of an object is viewed as the knowledge contained in that object, and the logical structure among objects is capture by a process of knowledge augmentation: the knowledge contained in an object is augmented with that of its structurally related objects. The knowledge augmentation process takes into account the fact that knowledge can be incomplete and become inconsistent. The model is based on the de nitions of four truth values, modal operators, possible worlds, accessibility relations and truth value assignments used to characterise content knowledge, structural knowledge, augmented knowledge, incompleteness and inconsistency.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Model for Representing and Retrieving Heterogeneous Structured Documents Based on Evidential Reasoning

Documents often display an internal structure; they are composed of components. For example, a journal contains several articles, which themselves contain paragraphs, tables, etc. With structured documents, the retrievable units should be the document components as well as the whole document. The components of a structured document can be of different types: various media, located in a number o...

متن کامل

Representing and Utilising Knowledge for Understanding Structured Documents

This paper presents a document analysis system which is capable of extracting the semantics of specific text portions of structured documents. The main component of the system is the knowledge representation scheme-called h s c o , Frame Representation of Structured Documents. It allows the definition of knowledge about document components as well as knowledge about analysis algorithms in a uni...

متن کامل

A Model for the Representation and Focussed Retrieval of Structured Documents Based on Fuzzy Aggregation

Effective retrieval of structured documents should exploit the content and structural knowledge associated with the documents. This knowledge can be used to focus retrieval to the best entry points: document components that contain relevant information, and from which users can browse to retrieve further relevant components. To enable this, suitable representation methods must be developed. Thi...

متن کامل

A Framework for the Retrieval of Multimedia Objects Based on Four-Valued Fuzzy Description Logics

Knowledge representation, in particular logic, combined together with database and information retrieval techniques may play an important role in the development of so-called intelligent multimedia retrieval systems. In this paper we will present a logic-based framework in which multimedia objects’ medium dependent properties (objects’ low level features) and multimedia objects’ medium independ...

متن کامل

Knowledge Extraction from Semi-structured Data Based on Fuzzy Techniques

In this work we propose a fuzzy technique to compare XML documents belonging to a semi-structured flow and sharing a common vocabulary of tags. Our approach is based on the idea of representing documents as fuzzy bags and, using a measure of comparison, evaluating structural similarities between them. Then we suggest how to organize the extracted knowledge in a class hierarchy, choosing a techn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002